298 research outputs found

    Deep learning-based 3D local feature descriptor from Mercator projections

    Get PDF
    Point clouds provide rich geometric information about a shape and a deep neural network can be used to learn effective and robust features. In this paper, we propose a novel local feature descriptor, which employs a Siamese network to directly learn robust features from the point clouds. We use a data representation based on the Mercator projection, then we use a Siamese network to map this projection into a 32-dimensional local descriptor. To validate the proposed method, we have compared it with seven state-of-the-art descriptor methods. Experimental results show the superiority of the proposed method compared to existing methods in terms of descriptiveness and robustness against noise and varying mesh resolutions

    Text to image synthesis for improved image captioning

    Get PDF
    Generating textual descriptions of images has been an important topic in computer vision and natural language processing. A number of techniques based on deep learning have been proposed on this topic. These techniques use human-annotated images for training and testing the models. These models require a large number of training data to perform at their full potential. Collecting human generated images with associative captions is expensive and time-consuming. In this paper, we propose an image captioning method that uses both real and synthetic data for training and testing the model. We use a Generative Adversarial Network (GAN) based text to image generator to generate synthetic images. We use an attention-based image captioning method trained on both real and synthetic images to generate the captions. We demonstrate the results of our models using both qualitative and quantitative analysis on popularly used evaluation metrics. We show that our experimental results achieve two fold benefits of our proposed work: i) it demonstrates the effectiveness of image captioning for synthetic images, and ii) it further improves the quality of the generated captions for real images, understandably because we use additional images for training

    A reinforcement learning-based approach for imputing missing data

    Get PDF
    Missing data is a major problem in real-world datasets, which hinders the performance of data analytics. Conventional data imputation schemes such as univariate single imputation replace missing values in each column with the same approximated value. These univariate single imputation techniques underestimate the variance of the imputed values. On the other hand, multivariate imputation explores the relationships between different columns of data, to impute the missing values. Reinforcement Learning (RL) is a machine learning paradigm where the agent learns by taking actions and receiving rewards in response, to achieve its goal. In this work, we propose an RL-based approach to impute missing data by learning a policy to impute data through an action-reward-based experience. Our approach imputes missing values in a column by working only on the same column (similar to univariate single imputation) but imputes the missing values in the column with different values thus keeping the variance in the imputed values. We report superior performance of our approach, compared with other imputation techniques, on a number of datasets

    Moisture evolution, thermal properties and energy consumption of drying spent grain pellets from a blend of some cereals for small scale bio-energy utilization: Modelling and Experimental study

    Get PDF
    © 2022 Springer Nature Switzerland AG. This is the accepted manuscript version of an article which has been published in final form at https://doi.org/10.1007/s13399-022-02846-xA fixed bed convective dryer was used to assess the influence of drying temperature and geometry deformation on moisture and thermo-physical property evolution of solid fraction pellets (spent grain) from wet milling of cereal blends for bio-energy generation for small homes. The aim is to study the physical mechanism of drying the pellets that includes temperature and moisture behaviour, transport phenomena, the response rate to varying process conditions, drying time, and energy utilization which can be applied in the development of a fixed bed dryer for drying the pellets at a lower scale. The modified Cranck's diffusion model was used to study moisture loss by introducing shrinkage. The verification of the model gave the mean absolute error (MAE) for moisture content with shrinkage as 0.0366 - 0.1500 while for without shrinkage was 0.0729 - 0.1500 for 60- 80 oC. The effective moisture diffusivity with integrating shrinkage is lower than non- shrinkage though these values varied with drying time. Fitting the moisture ratio with the exponential drying curve equations shows that logarithmic equations were the best model for drying at 60 and 70 oC while Henderson and Pabis's model was better at 80 oC isothermal drying. Thermophysical analysis showed that the average specific heat capacity ranges from 5423.387 to 5198.197J/kgK while the thermal conductivity ranged from 0.115281to 0.136882W/mK at 60-80 oC. The energy and specific energy consumption ranged from 0.41 to 0.494 kWh and 108.39 to 119.29MJ/kg. The shrinkage ratios, effective diffusivity and energy and specific energy consumption were empirically presented as a function of moisture, temperature and or air velocity variations with a high degree of association.Peer reviewe

    A High-Performance Spectral-Spatial Residual Network for Hyperspectral Image Classification with Small Training Data

    Get PDF
    In this paper, we propose a high performance Two-Stream spectral-spatial Residual Network (TSRN) for hyperspectral image classification. The first spectral residual network (sRN) stream is used to extract spectral characteristics, and the second spatial residual network (saRN) stream is concurrently used to extract spatial features. The sRN uses 1D convolutional layers to fit the spectral data structure, while the saRN uses 2D convolutional layers to match the hyperspectral spatial data structure. Furthermore, each convolutional layer is preceded by a Batch Normalization (BN) layer that works as a regularizer to speed up the training process and to improve the accuracy. We conducted experiments on three well-known hyperspectral datasets, and we compare our results with five contemporary methods across various sizes of training samples. The experimental results show that the proposed architecture can be trained with small size datasets and outperforms the state-of-the-art methods in terms of the Overall Accuracy, Average Accuracy, Kappa Value, and training time

    Relationship detection based on object semantic inference and attention mechanisms

    Get PDF
    Detecting relations among objects is a crucial task for image understanding. However, each relationship involves different objects pair combinations, and different objects pair combinations express diverse interactions. This makes the relationships, based just on visual features, a challenging task. In this paper, we propose a simple yet effective relationship detection model, which is based on object semantic inference and attention mechanisms. Our model is trained to detect relation triples, such as , . To overcome the high diversity of visual appearances, the semantic inference module and the visual features are combined to complement each others. We also introduce two different attention mechanisms for object feature refinement and phrase feature refinement. In order to derive a more detailed and comprehensive representation for each object, the object feature refinement module refines the representation of each object by querying over all the other objects in the image. The phrase feature refinement module is proposed in order to make the phrase feature more effective, and to automatically focus on relative parts, to improve the visual relationship detection task. We validate our model on Visual Genome Relationship dataset. Our proposed model achieves competitive results compared to the state-of-the-art method MOTIFNET

    SC-CAN: Spectral Convolution and Channel Attention Network for wheat stress classification

    Get PDF
    Biotic and abiotic plant stress (e.g., frost, fungi, diseases) can significantly impact crop production. It is thus essential to detect such stress at an early stage before visual symptoms and damage become apparent. To this end, this paper proposes a novel deep learning method, called Spectral Convolution and Channel Attention Network (SC-CAN), which exploits the difference in spectral responses of healthy and stressed crops. The proposed SC-CAN method comprises two main modules: (i) a spectral convolution module, which consists of dilated causal convolutional layers stacked in a residual manner to capture the spectral features; (ii) a channel attention module, which consists of a global pooling layer and fully connected layers that compute inter-relationship between feature map channels before scaling them based on their importance level (attention score). Unlike standard convolution, which focuses on learning local features, the dilated convolution layers can learn both local and global features. These layers also have long receptive fields, making them suitable for capturing long dependency patterns in hyperspectral data. However, because not all feature maps produced by the dilated convolutional layers are important, we propose a channel attention module that weights the feature maps according to their importance level. We used SC-CAN to classify salt stress (i.e., abiotic stress) on four datasets (Chinese Spring (CS), Aegilops columnaris (co(CS)), Ae. speltoides auchery (sp(CS)), and Kharchia datasets) and Fusarium head blight disease (i.e., biotic stress) on Fusarium dataset. Reported experimental results show that the proposed method outperforms existing state-of-the-art techniques with an overall accuracy of 83.08%, 88.90%, 82.44%, 82.10%, and 82.78% on CS, co(CS), sp(CS), Kharchia, and Fusarium datasets, respectively
    corecore